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1 • A system (1) for performing automatic dubbing on an incoming audio- 

visual stream (2), said system (1) comprising: means (3, 7) for identifying the speech 
content in the audio-visual stream (2); a speech-to-text converter (13) for converting the 
speech content into a digital text format (14); a translating system (15) for translating 
the digital text (14) into another language or dialect; a speech synthesizer (19) for 
synthesizing the translated text (18) into a speech output (21); and a synchronizing 
system (9, 12, 22, 23, 26, 31, 33, 34, 35) for synchronizing the speech output (21) to an 
outgoing audio-visual stream (28). 

2. The system (1) of claim 1, containing a voice profiler (10) for generating 

voice profiles (1 1) for the speech content and for allocating the appropriate voice profile 
(1 1) to the translated text (14) for speech output synthesis. 

3 - 1116 system (1) according to claim 1 or claim 2, wherein the system (1) 

contains a source of time data (4) for the allocation of timing information to the audio 
and video contents (4, 5) for later synchronisation of these contents. 

4. The system (1) according to any preceding claim, wherein the translation 
system (15) contains a language database (17) with a plurality of different languages 
and/or dialects and means for selection of a language or dialect from this database (17) 
into which the digital text (14) is to be translated. 

5. The system (1) according to any preceding claim, wherein the system (1) 
contains an open-caption generator (29) for the creation of open captions (30) using the 
digital text (14) and/or the translated digital text (18), for inclusion in an outgoing 
audio-visual stream (28). 
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6. An audio-visual device comprising a system (1) according to any of the 

preceding claims. 

5 7. A method for automatic dubbing of an incoming audio-visual stream (2), 

which method comprises: identifying the speech content in the audio-visual stream (2);' 
converting the speech content into a digital text format (14); translating the digital text 
(14) into another language or dialect; converting the translated text (18) into a speech 
output (21); synchronizing the speech output (21) to an outgoing audio-visual stream 
10 (28). 

8- The method of claim 7, wherein voice profiles (1 1) for the speech 

content are generated and allocated to the appropriate translated text (18) in the 
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synthesis of speech output (21) 

9. The method of claim 7 or 8, wherein a copy of the speech content is 

diverted from the audio-visual stream (2) or from an audio content of the audio-visual 
stream (2). 



20 10. The method of claim 7 or 8, wherein the speech content in the audio- 

visual stream (2) is separated from the remaining audio-visual stream or from an 
remaining audio content of the audio-visual stream (2). 
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11. The method according to any preceding claim, wherein an audio/video 
combiner (26) inserts the speech output (21) into the outgoing audio-visual stream (28), 
replacing the original speech content. 

12. The method according to any preceding claim, wherein an audio/video 
combiner (26) overlays the speech output (21) into the outgoing audio-visual stream 



30 (28). 



